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Abstract — For a family/sequence of Space-Time Block Codes 
(STBCs) Ci,C2,..., with increasing number of transmit an- 
tennas Ni, witli rates Ri complex symbols per channel use 
(cspcu), i — 1,2,..., the asymptotic normalized rate is defined as 
linii^oo A family of STBCs is said to be asymptotically-good 
if the asymptotic normalized rate is non-zero, i.e., when the rate 
scales as a non-zero fraction of the number of transmit antennas, 
and the family of STBCs is said to be asymptotically-optimal if the 
asymptotic normalized rate is 1, which is the maximum possible 
value. In this paper, we construct a new class of full-diversity 
STBCs that have the least maximum-likelihood (ML) decoding 
complexity among all known codes for any number of transmit 
antennas A'^ > 1 and rates R > I cspcu. For a large set of {R, N) 
pairs, the new codes have lower ML decoding complexity than the 
codes already available in the literature. Among the new codes, 
the class of full-rate codes (i? = TV) are asymptotically-optimal 
and fast-decodable, and for N > 5 have lower ML decoding 
complexity than all other families of asymptotically-optimal, fast- 
decodable, full-diversity STBCs available in the literature. The 
construction of the new STBCs is facilitated by the following 
further contributions of this paper: (i) For g > 1, -we construct 
g-group ML-decodable codes with rates greater than one cspcu. 
These codes are asymptotically-good too. For g > 2, these are the 
first instances of g-group ML-decodable codes with rates greater 
than 1 cspcu presented in the literature, (ii) We construct a new 
class of fast-group-decodable codes (codes that combine the low 
ML decoding complexity properties of multigroup ML decodable 
codes and fast-decodable codes) for all even number of transmit 
antennas and rates 1 < i? < 5/4. (iii) Given a design with full- 
rank linear dispersion matrices, we show that a full-diversity 
STBC can be constructed from this design by encoding the real 
symbols independently using only regular PAM constellation^]. 

I. Introduction 

Consider an N transmit, N,. receive antenna quasi-static, 
Rayleigh flat-faded MIMO channel Y ^ XH + W, where X 
is the T X N transmitted codeword matrix, H is the N x Nr 
channel matrix, W is the T x Nr noise matrix and Y is 
the T X Nr matrix of received signal. A design in K real 
symbols xi,...,xk for N transmit antennas is a matrix 
X = X]t=i ^i^i' where the linear dispersion or weight ma- 
trices Ai e C^^^, i = 1, . . . , if, and the set {Ai, ... , Ak} 
is linearly independent over M. The rate of the design is 
^ ~ W ^^P^^- We say that the design is of high-rate if 
R> 1 and full- rate if R — N. An STBC can be constructed 
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from a design X by making the real symbols take values 
from a finite subset A C M^. The finite set A is called 
the signal set. Denote the resulting STBC by C(X.,A), i.e., 

CCX^A) ~ {J2f=i0.i^i\[<^iT ■ ■ tO'k]^ ^ -M- For ^^y non- 
empty subset r C {1, . . . , if}, let x{r) ^ {xi\i e T}. An 
STBC C(X,^) is said to be g-group ML decodable, g > 1, 
or multigroup ML decodable if there exists a partition 
Fi, . . . , Fg of the set {!,..., K} such that the symbol groups 
x(Fi), . . . , x{Tg) can be ML decoded independently of each 
other. If \Tj\ — K/g for all j = 1, . . . , g, we say that the 
multigroup ML decodable code is balanced, else the code is 
said to be unbalanced. All the multigroup ML decodable codes 
considered in this paper are balanced. It is known IT], ID, ||3l, 
that if the weight matrices corresponding to symbols in dif- 
ferent groups are Hurwitz-Radon orthogonal, i.e., 

A^Ag + AfAp = for p G F„„ g G T^, 1 < m < n < g, 

and if the symbol groups a;(Fi), . . . , a;(Fg) are encoded inde- 
pendently, then the STBC C{X,A) is .g-group ML decodable 
with respect to the partition Fi,...,Fg. For such codes, if 
M is the size of the underlying complex constellation, then 
the ML decoding complexity is only Ylj=i ^ instead of 

Ai^J = l 2 . 

An STBC C(K,A) is said to he fast-decodable |4| (FD) or 
conditionally- g-group ML decodable, g > 1, if there exists a 
non-empty subset F C {!,..., K}, such that for each of the 
values that a;(F^) assumes, the remaining symbols x(r) can be 
g-group ML decoded conditioned on the value of 2;(F^). Here, 
F^ is the complement of F in the universal set {1, . . . , K}. If 
the conditional ML decoding groups in F are balanced, then 
the ML decoding complexity is only Ai 2 . gM instead 
of Ai 2 ' . An STBC C{X,A) is said to he fast -group- 
decodable [5 1 (FGD) if the STBC is multigroup ML decodable 
with respect to a partition Fi, . . . , Fg, g > 1 of {!,..., K} 
and atleast one group of symbols a;(Fj), 1 < j < g, is 
fast-decodable. Such a code combines the low ML decoding 
complexity properties of FD and multigroup ML decodable 
codes. 

Let {Ci,C2,...} be an infinite family of STBCs for in- 
creasing number of transmit antennas Ni,N2, . . . with rates 
i?i , i?2 , • • ■ respectively. 

Definition 1: The asymptotic rate of the family of codes 
{Ci, C2, . . . } is defined to be the limit, lim^^oo Ri, if it exists. 



Definition 2: The asymptotic normalized rate of the fam- 
ily of codes {Ci , C2 , . . . } is defined to be the limit, 
r = limi_>oo if it exists. In other words, r is the limit 
of the ratio of rate to full-rate of the given sequence of codes. 

Definition 3: The family of codes {Ci,C2, ■ ■ ■} is said to 
be asymptotically-good if its asymptotic normalized rate, r is 
non-zero, else the family of codes is said to be asymptotically- 
bad. 

Definition 4: The family of codes {Ci ,€2, ■ ■ ■} is said to 
be asymptotically-optimal if its asymptotic normalized rate, r 
is equal to 1, the maximum possible value. 

A family of codes with zero asymptotic rate is necessarily 
asymptotically-bad and a family with non-zero asymptotic 
rate may still be asymptotically-bad. In other words, an 
asymptotically-good family of codes has non-zero asymptotic 
rate, but the converse need not be true. The following examples 
as well as Table U illustrate this. 

Example 1: Families of multigroup ML decodable codes 
with zero asymptotic rate: STBCs from orthogonal de- 
signs |l6l-|[8|, satisfy g = K and hence offer the least ML 
decoding complexity. In [8], maximal rate square complex 
orthogonal designs were given for number of antennas that are 
a power of 2. The rate for 2™ antennas is ^^Ir-. Single complex 
symbol ML-decodable codes, i.e., codes with g ~ K/2, were 
given in IS, ||9l, HOl, HI]. For 2" antennas, the rate of 
the single complex symbol decodable codes is 2^^- Since 
the rates of orthogonal designs are low, quasi-orthogonal 
designs lfT2i were proposed with higher rate, although with 
higher ML decoding complexity. All these families of codes 
have asymptotic rate zero and hence are asymptotically-bad. 

Example 2: Families of multigroup ML decodable codes 
with non-zero asymptotic rate but are asymptotically-bad: 
In Q, maximal rate orthogonal designs, not necessarily 
square, were constructed for arbitrary number of antennas. 
For number of antennas TV — 2m and 2m — 1, m > 1, the 
codes constructed in [7] have a rate of In H], 121, the 

framework for multigroup ML-decodable codes was given. 
In |[T], delay-optimal 4-group ML-decodable, rate-one codes 
were constructed for even number of antennas. In [3|, codes 
for 2™ antennas with g — 4 and rate one were given 
based on Extended Clifford Algebras. Also in |l3l, Precoded 
Coordinate Interleaved Orthogonal Designs (PCIODs) having 
g — 4, R — 1 were given for even number of antennas in 
the setting of cooperative communications. Four-group ML- 
decodable, rate one codes for 2™ antennas were also given 
in L13J. Multigroup ML-decodable STBCs based on Clifford 
Unitary Weight Designs (CUWDs) were introduced in lfT4l . 
Low ML-decoding complexity STBCs that use Pauli matrices 
as the weight matrices were given in |15|. The g-group ML 
decodable codes from CUWDs |3| and from jlSl can achieve 
a rate of ^ ^ cspcu, for arbitrary g and number of transmit 

antennas N = 2"\ m> [f - 1]. 

Example 3: All the known families of asymptotically-good 
multigroup ML decodable codes: A 2-group ML-decodable, 
rate 5/4 code for 4 transmit antennas was reported in Iil6i . 



In [TTl, 2-group ML-decodable, delay-optimal codes for 
N = 2™ antennas, to > 1, were given based on Clifford 
Algebras. The code obtained in [TTJ for 4 antennas is equiv- 
alent to the code in lfT6l . The codes in ifTTl have a rate of 
^ + cspcu. This class of codes is asymptotically-good, 
having r = j. Recently, unbalanced and balanced high-rate, 
2-group ML-decodable codes were reported in ifTSl for any 
number of transmit antennas N. The class of balanced codes 
in [18J are not delay-optimal and they require T > 2N. These 
codes achieve a rate to full-rate ratio ofi?/iV=l — ^ + 
Thus, by increasing the delay T appropriately with N, an 
asymptotic normalized rate of 1 can be achieved, and hence 
these codes are asymptotically-optimal. The family of codes 
in (17) and (IE] are the only asymptotically-good multigroup 
ML-decodable codes reported in the literature so far. 

Example 4: All known families of asymptotically-optimal, 
full-diversity codes with low ML decoding complexity: In [151, 
low ML decoding complexity, full-diversity STBCs with cubic 
shaping property |fT9l were constructed for N — 2™ via 
codes over the finite field GF(4). The class of full-rate codes 
constructed in IfTSlI for 2"\ to > 1, antennas are conditionally 
4-group ML decodable, information-lossless and constitute a 
family of asymptotically-optimal FD STBCs. Until this paper, 
the FD codes of |15| had the least known ML decoding 
complexity for i? > -| + and TV = 2", m > 1. In ||20| . 
a family of full-diversity, full-rate STBCs called Threaded- 
Algebraic Space-Time (TAST) codes were constructed for all 
number of transmit antennas > 1. These codes are condi- 
tionally 2-group ML decodable and hence constitute a family 
of asymptotically-optimal FD codes. The fast-decodability 
property of TAST codes, which we prove in Section |V] of 
this paper, was not reported in ll20l . 

In ||2T|. delay-optimal, full -rate and large coding gain codes 
were constructed from division algebras for arbitrary number 
of antennas. The asymptotic normalized rate of this family 
of codes is the maximum-possible value 1 and hence are 
asymptotically-optimal. However, these codes are 1-group 
ML-decodable, i.e., are not multigroup ML-decodable. The 
notion of fast-decodability was introduced in [4\. Rate 2 cspcu 
STBCs using designs with largest known coding gain were 
constructed for 2 and 4 antennas in 1221 . These codes are 
fast-decodable as well. It was shown in l22l and l23l that the 
Golden Code ll24l is fast-decodable. The Silver code, a rate- 
2 FD code for 2 transmit antennas, was found independently 
in l25l and l26l . In 1^|, FD codes, called Embedded Alamouti 
Space-Time codes, were constructed for N — 2,4,6,8 and 
R = 1, . . . , N/2 cspcu. A rate 3/2 cspcu FD code for 4 an- 
tennas was constructed in l28]| , and in l29l a rate 2, 4 antenna 
FD code was constructed using Crossed Product Algebras. 
The 2 antenna code of f22l and the codes in 113-123, HI 
have non-vanishing determinant property. FGD codes were 
introduced in Q, where a rate 17/8 cspcu, 4 antenna FGD 
code was constructed. In fT5l|, FGD codes were constructed 
for 2™ antennas and rates 1 < i? < 5/4. 

The contributions and organization of this paper are as 
follows. 



We give a general procedure for constructing full- 
diversity, multigroup ML-decodable, delay-optimal 
codes. Using this procedure, we construct asymptotically- 
good, 5-group ML decodable, delay-optimal, full- 
diversity STBCs for g > 1. For g > 2, the constructed 
class of STBCs contain the first instances of g-group 
ML decodable codes with rates greater than 1 cspcu and 
are the first instances of asymptotically-good g-group 
ML decodable codes reported in the literature. The 
class of 2-group ML decodable codes constructed in 
this paper match in terms of rate, the only other known 
class of asymptotically-good, delay-optimal 2-group 
ML-decodable codes, that were given in IITtII . Moreover, 
the proposed class of codes is for all N > 1, whereas 
the codes in ifTTI are only for N = 2™, m > 1. 
Specifically, we construct asymptotically-good, full- 
diversity, delay-optimal g-group ML decodable codes, 
5 > 1, for TV = n2LT^J 
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antennas, n > q, with rate 

2 

+ ^2lv^ cspcu, where !{•} 



is the indicator function (Section HiH). 
For g > 3, we propose g-group ML-decodable codes with 
the best-known asymptotic normalized rate. For g = 2, 
we construct codes that match the rates in ifTSil when the 
delays are same. Specifically, for g > 1, we propose non- 
delay-optimal codes with N ~ n2^^~^K n > 1, T = gN 
and R = i^^^ + The constructed codes have rate 
greater than one for N > 2^-^ (Section HiHi. 
We construct a new class of full-diversity, FGD codes for 
N = 2m, TO > 1, antennas and rates 1 < i? < 5/4, which 
can be ML decoded with complexity SA/t^"*^"^^" 2 . The 
new class of FGD codes match, in terms of ML decoding 
complexity, the class of FGD codes that were constructed 
in 1115] for the same range of rates but for a smaller set 
of number of antennas N = 2™, to > 1 (Section HvTi. 
Using the new asymptotically-good multigroup ML de- 
codable codes and FGD codes, we construct a new class 
of delay-optimal, full-diversity STBCs that have the least 
ML decoding complexity among all known codes for all 
iV > 1 and R > 1. For a large subset of {R,N) pairs, 
the new codes have lower ML decoding complexity than 
the codes already available in the literature. Among the 
new codes, the class of full-rate codes are asymptotically- 
optimal and fast-decodable, and for > 5, have lower 
ML decoding complexity than all other known families of 
asymptotically-optimal, full-diversity, FD codes (Table HiH 
in Section |V] gives a summary of comparison of ML 
decoding complexities of the new codes with other codes 
in the literature). In particular, whenever N > 6, N not 
a power of 2 and i? > 1, or iV > 6, a power of 2 
and R > 3/2, the new codes have lower ML decoding 
complexity than the codes available in the literature. For 
A^ a power of 2 and 1 < i? < 3/2, the new codes have 
lower ML decoding complexity than already known codes 
when A is sufficiently large. For all other {R, N) pairs, 
the new codes match the existing codes in the literature 



in terms of ML decoding complexity (Section IVb. 
• Given a delay-optimal design with full-rank weight ma- 
trices, we show that a full-diversity STBC can be con- 
structed from this design by encoding the real symbols 
independently, using only regular PAM constellations 
(Section HB. 

Directions for future work are discussed in Section [Vl] 
The new asymptotically-optimal FD codes of this paper for 
A > 4 antennas have lower ML decoding complexity than 
the asymptotically-optimal FD codes constructed in |15|. But, 
the codes in lITSl possess additional desirable properties such 
as cubic shaping and information-losslessness which the new 
codes of this paper do not enjoy. 

Notation: For a complex matrix A the transpose, the conjugate 
and the conjugate-transpose are denoted by ,A and 
respectively. A®B is the Kronecker product of matrices A and 
B. In is the n X n identity matrix and is the all zero matrix 
of appropriate dimension.. Cardinality of a set F is denoted 
by |r| and i = T. For a square matrix A, det{A) is the 
determinant of A. For square matrices Ai, . . . , Ad, d > 1, 
diag{Ai, . . . ,Ad) denotes the square, block-diagonal matrix 
with Ai,...,Ad on the diagonal, in that order. 1{ } is the 
indicator function. 

II. Full-diversity using regular PAM 

CONSTELLATIONS 

In this section, we show that given a delay-optimal design X 
with full-rank weight matrices, we can create another design 
X', whose weight matrices are scaled versions of those of 
X, such that the STBC obtained from X' by encoding the 
symbols with a regular PAM constellation is fully diverse. 
If the scaling coefficients belong to M this is equivalent to 
encoding each symbol of X with a regular PAM of possibly 
different minimum distance. In fTSl . it was shown that a full- 
diversity STBC can be obtained from a design with full-rank 
weight matrices by encoding the real symbols independently, 
but not necessarily using structured constellations, such as 
regular PAM. Further, the constellation used by each symbol 
in IIT5I is possibly different. In this case, in order to map 
the input information bits to symbols, a lookup table of 
length equal to the size of the constellation is required for 
each of the real symbols. However, each real symbol in 
the new design X' is independently encoded with the same 
regular PAM constellation, which can be effected by using a 
binary to decimal converter and thus leading to low encoding 
complexity. 

Consider an N x N design X = X^ili ^i^^ such 
that each A. 

a — [ai,a2, ■ ■ ■ , aK\ G 



i = 1, . . . , AT, is invertible. For any 
^, let X^ = ^^iX,A^, where 
. , K. For any subset 5 C C, let 



~ aiAi, for i — 1, . 
AS = {a-b\a,beS}. 

Theorem 1: Let *B be any infinite subset of C and A' be any 
finite subset of K. Given an A^ x A^ design X — X^iLi xiAi 
with full-rank weight matrices Ai, there exists an a G 55^ 
such that the STBC C(X^,^' x • • • x y^') has full diversity. 



Proof: Proof is via induction on K. We first prove the 
result for K ~ 1 and then prove the induction step. 

When if = 1, we have X — xiAi. Choose ai to be any 
non-zero element of *8. Then — xiaiAi. Since Ai is full- 
ranked, it is clear that the STBC C{X'^,A') is fully diverse. 

We now prove the induction step. Let K > 1, 
X = XiA, and a e be such that the STBC 

Ck = C{X'-,A' X - --x A') is fully diverse. Let 
X ^ '^f^^ XiAi, where Ak+i is invertible. We now 
show that there exists an ax+i G *B such that the STBC 
Ck+i = C(X^,^' X • • • X is fully diverse, where 

a — [cP" aK+iY ■ Consider the difference of any two 
codeword matrices in Ck+i, say 5C. Then, 5C = SCa + SCt, 
where 5Ca is a difference of two codewords in Ck, not 
necessarily different, and 6Cb S {(uk+iAk+iIC G A^'}. 
There are only a finite set of values that SCi, can assume. 
The condition SC ^ leads to the following cases. 

1. 6Ca 7^ and SCb = : In this case, 5C = SCa, which is 
a codeword difference matrix of the fully diverse STBC Ck- 
Hence SC is invertible. 

2. SCa + and SC^-h^ : In this 
case SC = SCa + Co/k+iAk+i- Let 
fsCaXi^) = det{SCa + (zAk+i) G C[z]. fsc^xi^) is 
not identically zero since fsCa-ci^) = det{SCa) 7^ 0. For SC 
to be full-rank, aK+i should not be a zero of fsCa-ci-^)- 

3. SCa = and SCb 7^ 0: In this case SC — Cc^k+iAk+i 
where C, £ AA' \ {0}. SC will be of full-rank as long as 
aK+i ^ 0. 

From the above three cases it is clear that Ck+i will be 
fully-diverse as long as ok+i is not a root of any of the 
polynomials fsCa. ci^) ^'^'^ ctK+i 7^ 0. Each of the polyno- 
mials is non-zero and hence has only a finite set of roots 
in C. Further, there are only a finite number of polynomials 
fsCaXi^)- Hence, the set of values corresponding to aK+i in 
C not leading to full diversity is finite. Thus, the set of values 
corresponding to aK+i in *B C C not leading to full diversity 
is also finite. Since *8 is an infinite subset of C, there exists 
an aK+i € such that Ck+i is fully diverse. This proves 
the induction step. ■ 

Let Aq —PAM C M denote the regular Q-ary PAM constella- 
tion with zero centroid and unit minimum Euclidean distance. 

Corollary 1: Let *8 be any infinite subset of C and Q be 
any positive integer Given an x design X = X^i^i ^^i^i 
with full-rank weight matrices Ai, there exists an a S 
such that the STBC C{X'^,Aq^pam x ■■■ x Aq-pam) is 
fully diverse. 

Proof: Follows from Theorem [T| on substituting A' with 
Aq-pam- ■ 

For d > Q, let Ad.Q denote the regular Q-ary PAM constel- 
lation with zero centroid and minimum Euclidean distance d. 



Corollary 2: Given a positive integer Q and an x 
design X — X^ili ^i^i with full-rank weight matrices 
Ai, there exist d^ > 0, i ^ I, . . . , K, such that the STBC 



Proof: Let *B = G M|e > 0} and A' = Aq-pam- 
From Theorem [T] there exists d — [di, . . . , cZa']"^ G such 
that the STBC C{X'^, Aq-pam x - -■ x Aq-pam) is fully 
diverse. But this STBC is same as C (X, Adi,Q x - - - x Ad^.g)- 
The desired result follows. ■ 
We now give some examples with the help of STBCs 
already known in the literature. 

Example 5: In [28l, an FD STBC for A^ = 4 antennas and 
i? = i cspcu was constructed using full-rank weight matrices. 

XjAi. From Corollary |2l 
such that 



Denote this design by X = J2i=i 
for a fixed Q, there exists a [di, - . 
C{X,Adi,Q X ••• X Adi2,Q) is fully-diverse. In |28|, it was 
shown that the choice di = 1, i = 1, . . . , 12, provides full 
diversity for Q = 2 and Q = A. 

Example 6: The 2x2 Coordinate Interleaved Orthog- 
onal Design (CIOD) |9| is constructed using the design 
xi + 1x2 
x^ + ix4 
from square-QAM rotated by an angle tp = ^tan~^{2) rad. 
The STBC can be equivalently obtained from the design 
X = Y.i=i Xi^i, where 



Here, xi+ix^ and X3+ix2 take values 



A, = 

^3 = 



costp 

isirup 

isinif 

costp 



^2 = 
A4 = 



—sinp 
icosp 

icosp 
—sinp 



(1) 



and by encoding each real symbol independently from 
Aq-pam- Since each of the Ai, i — 1, ... ,4 is full rank, 
it follows from Corollary |2] that for a given value of Q, there 
exist di, . . . ,d4 such that the signal set Ad^.Q x ■ ■ ■ x Ad4,,Q 
leads to a full-diversity STBC. 

Proposition 1: For the design ([T]i, for any positive integer Q 
and any d = [^1,^2,^3,^4]^ G {Z^\di > 0,i = 1, . . . ,4}, the 
resulting STBC C(X, Adi,Q x - - - x Ad^^g) h^s full diversity. 

Proof: It is straightforward to show that the 
set of codeword difference matrices of the STBC 
C{X,Adi,Q X ■ ■ ■ X AdK,Q) is a subset of the set 
of codeword difference matrices of the STBC 
C{X, Ai.mQ X ■■■ X Ai.rnQ), whcrc m — did2d^di. It is 
sufficient to show that the STBC C(X, Ai^mQ x - ■ ■ x Ai^mg) 
has full diversity. Note that this STBC is same as the 2x2 
CIOD whose complex symbols take values from rotated 
(m(3)^-ary QAM. This code, however, is known to offer full 
diversity ||9l. ■ 

Example 7: In 1221 . a fast-decodable rate 2 code for 2 
antennas was constructed using the design X = Yl\=i ^i^i, 
where , . . . , A4 are as given in ([T]) and 



As 



A7 






isinp 


cosp 








cosp 


isinp 






Ae^ 



As^ 





— sinp) 


icosp 



icosp 


—sinp 




C{X,Ad„ 



Ad^.Q) is fully diverse. 



with p — ^tan ^(2) rad- Note that all the weight ma- 
trices are of full rank. Let ^ = {e'^lO e [0,27r)} be the 
unit circle in the complex plane. From Corollary [T] there 



exist Ui, i — 1, ... ,8 such that, e ?B and the STBC 
C{X'^,Aq-pam X ••• X Aq-pam) has full diversity, for 
a given positive integer Q. The code obtained in [22] is 
the above STBC with a = [1, 1, 1, 1, e''^ e'^, e''',e'^f, where 
9 = J. In f22l|, 6 was optimized for diversity and coding gain. 
This code also has non-vanishing determinant property. This 
suggests that for a design with full-rank weight matrices, there 
may exist an a that leads to full diversity for any value of Q. 

The ML decoding complexity of an STBC C(X, A) is deter- 
mined by both the design X and the signal set ^ O. For some 
1 < p < q < K, even if the weight matrices Ap and Ag are 
Hurwitz-Radon orthogonal, but if the choice of A is such that 
Xp and Xq have to be jointly encoded, then the two symbols 
need to be necessarily jointly ML decoded. The following 
definition can be used to denote the contribution of the choice 
of weight matrices towards ML decoding complexity of an 
STBC. Let, for any design X = X^iLi ^i^i ™d non-empty 
subset r C {1, . . . , K}, Xr denote the design X^ier ^i^i- 

Definition 5 ( [IS]): Consider a design X = X^ili ^i^i- 

1) X is said to be (7-group ML decodable if there exists 
a partition of {1,...,K} into g non-empty subsets 
Fi, . . . , Fg, such that 

A^Ai+A^Ak = for / G Fj, fc e Fj, l<i<j< g. 

2) X is said to be fast-decodable if there exists a non- 
empty subset F C {1, . . . , K} such that the design Xr 
is g-group ML decodable for some g > 1. 

3) X is said to be fast-group-decodable if X is (7-group 
ML decodable with respect to the partition Fi, . . . , Fg, 
and there exists an i, 1 < i < g, such that Xp; is fast- 
decodable. 

If X is a multigroup ML decodable, FD or FGD design, then 
the STBC obtained from X by encoding the real symbols 
independently of each other is a multigroup ML decodable, FD 
or FGD STBC. All the new designs in this paper have full-rank 
weight matrices. From Corollary |2l full-diversity STBCs can 
be constructed from these designs by encoding the symbols 
independently with regular PAM constellations. In the remain- 
ing sections of the paper the focus is on constructing low ML 
decoding complexity designs and not on the construction of 
full-diversity achieving signal sets. 

III. New Asymptotically-Good, Multigroup ML 
Decodable Codes 

In this section, we first give a general procedure to construct 
multigroup ML decodable codes, and then using this proce- 
dure, we construct a new class of delay-optimal, full-diversity 
(7-group ML decodable codes for N — n2^^~^ antennas 
with 5 > 1 and n > g, having asymptotic normalized rate 
g2g-i ■ We also construct g-group ML decodable full-diversity 

STBCs for N = n2L^J, n > 1, and delay T ^ gN with 
asymptotic normalized rate 2?^- For g > 2, these are the first 
instances of high-rate g-group ML decodable codes reported 
in the literature. 



A. General construction procedure 

In this subsection, we give a general procedure to con- 
struct a full-diversity, delay-optimal y-group ML-decodable 
code, g > 1, starting from g sets of matrices. Mi C C^'^^', 
I = 1, . . . , g, that satisfy the following two conditions for each 
l = l,...,g. 

CI. The matrices in A/j are full -rank and linearly inde- 
pendent over R, 
C2. Ml can be partitioned as Si U Si, 
with |5;|>0 and |5;|=.g-l. With 
Si = {Ai,i, . . . ,Ai,i_i,Ai^i+i . . . ,Ai^g}, the 
following hold 
. A^jB + B^Ai^j = for all Aij e Si, B e Si 
and 

• ^5i^/j2 + Aj.^iji - for 1 < ji <h<g 
and ji,j2 ^ I- 

The sets Afi, I — 1,. . . ,g, need not be distinct. If the above 
conditions are satisfied by a set J\fg, then one can even use 

We proceed by introducing functions (7+, g_ and /; which 
are used to construct multigroup ML decodable codes starting 
from sets of matrices satisfying the conditions CI and C2. 
Let 5 be a finite, non-empty subset of C^^^^^ consisting of 
full-rank matrices that are linearly independent over M. Let 
C e C^2^^2 be any full-rank matrix. Define 

g+{S, C) = {diag{A, C)\A eS}U {diag{Ao, -C)} and 
C) = {diagiC A)\A eS}U {diag{-C, Ao)}, 

for some Aq e S. Although g+{S,C) and g^{S,C) depend 
on the choice of Aq in addition to S and C, the properties 
of g+{S,C) and g^{S,C) that this paper deals with, do not 
depend upon the particular choice of Aq. The results are valid 
for any choice of Aq £ S. Hence, for the sake of simplicity, 
we continue using this notation. 

Both (7_|_(5, C) and g^{S,C) are finite subsets of 
l^{N,+N2)x{N^+N2) ^j^jj cardinality |5| + 1 consisting of full- 
rank matrices. Further, if C and all matrices in S are unitary, 
then all the matrices in g+{S, C) and g-{S, C) are unitary. 

Proposition 2: The sets of matrices g+{S,C) and 
g-{S, C), are both linearly independent over M. 

Proof: Proof is given for g-^-{S,C). Proof for g^{S,C) 
is similar Since the set S is linearly independent, it is clear 
that {diag{A, C)\A G S} is linearly independent. It is enough 
to show that diag{AQ, —C) does not belong to the subspace 
spanned by {diag{A,C)\A e S} over R. If diag{Ao,-C) 
belongs to this subspace then, there exist a set of real numbers 
{^A\Ae S} such that 

diag{Ao,-C) = ^ ^Adiag{A,C). 

Aes 

Since 5 is a linearly independent set, for all A Aq, (^a — 0- 
Thus we have diag{Ao,C) — ^Aodiag{Ao, —C). However, 
since both Aq and C are full-rank and hence non-zero, such a 
^Aa does not exist. Hence, g+{S, C) is a linearly independent 
set. ■ 



Let d > 1 be any integer. Let A^i , . . . , Nd be positive inte- 
gers. Fix an / e {1, . . . ,d}. For each i e {1, . . . ,d} \ {I}, let 
d G C^' be full -rank. Let 5 be a finite, non-empty subset 
of C^'^^' consisting of full -rank matrices that are linearly 
independent over R. Define N = X^iLi ^i- We construct 
a finite set, Ci, . . . , G+i, . . . , Cd) C C~><^ as 

follows. Let Sq = S. For i — 1, . . . ,1 — 1, construct Si recur- 
sively as: Si = g-{Si-i, Ci). For i — I + \, . . . ,d, construct 
Si-i recursively as: Si^i — g+{Si-2,Ci). Finally, assign 

fl{S, Ci, . . . , C;_i, C;+i, . . . , Cd) — Sd-l- 

By the above construction and Proposition |2] we 
see that fi{S, Ci, . . . , C;_i, C;+i, . . . , Cd) consists of full- 
rank, R-Unearly independent matrices. If the matrices 
Ci and the matrices in S are unitary, the matrices in 
fi{S, Ci, . . . , C;_i, Ci+i, . . . , Cd) are also unitary. We note 
that \fiiS,Ci,...,Ci^i,Ci+i,...,Cd)\ = \S\+d-l. It is 
cleai- that the mati'ices in Ci, . . . , C;_i, C;+i, . . . , Cd) 
have a block-diagonal structure, with d blocks on the diagonal. 
For i G {1, . . . ,d}\ {I}, the i*'' block is composed of ±Ci. 
The l*^ block consists of matrices from S U —S. 

For / = 1, . . . , 5, let TV; C C^' ""^^ satisfy the conditions CI 
and C2 given earlier in this subsection. For each I ^ 1, . . . , g, 
define Mi = fi{Si, Ai,i, . . . , Ai^ij, Ai+ij, . . . , Ag,i). For 
each I, Ml C C^^^, where, N = YlLi ^i- 

Proposition 3: M — ^^^iMi contains full -rank, R- 
linearly independent matrices which satisfy 

B^B2 + B^Bi = for Bi e X,,, e Mi,, 

for I < h < h < g- 

Proof: The fact that the matrices are full-rank is clear. 
We now prove the Hurwitz-Radon orthogonality property. 

Let 1 < I < k < g and Bk € Mk and Bi e Mi. 
All the matrices in M have block diagonal structure, 
with the j*^ block as a matrix in C^^^^^. 
Thus we have Bi = diag{Bi[l], . . . , Bi[g]) and 
Bk = diag{Bk[l], ■ ■ ■ ,Bk[g]). Consider ji{l,k}. 
We have Bi[j] ^ ±Aj^i and Bk[j] = ±Aj^k- Thus, 
Bi [il^Bfc [j] + Bk [j]"Bi [j] = 0. Now consider j = I. 
We have Bi[j]eSiU-Si and Bk[]] = ±Aj^k- 
Thus, Bi[j]^ Bk[j] + Bk[j]" Bi[j] ^ 0. Similar resuh 
holds for j — k also. Thus for all j = 1, . . . , g, 
we have Bi[j]" Bk[j] + Bk[j]" Bi[j] = 0. Hence 
Bf Bfe + Bf B, = 0. 

We now show that the set M is linearly independent 
over R. Let span{Mi) denote the subspace of C^^^ 
spanned by the elements of Mi as a vector space over 
R. From Proposition IH it is clear that each Mi is a lin- 
early independent set. It is therefore enough to show that 
span{Mi) n span{Mk) ~ {0} for ^ 7^ A:. In order to prove 
this, let B G span{Mi) O span{Mk)- Therefore for some set 
of real numbers {^A\AeMk}, B = J2AeMk^A^- Since 
B e span{Mi) we have, for all A e Mk, B"A + A" B = 0. 
Hence, B^ B + B" B = 0. Thus, B = 0. ■ 

We construct an STBC C C C^^^, by using the matrices 
in M as the linear dispersion matrices of a design X. Let 
X = J^BeM^^B, where the real symbols xb are indexed 



by the elements of M. From Corollary |2] the design X, ob- 
tained via the proposed construction method, can be combined 
with appropriate regular PAM constellations to obtain a full- 
diversity, g-group ML-decodable STBC C. For I = 1,. . . ,g, 
the l*^ decoding group is {x^j^ S Mi}. The rate of the code 
in cspcu is i? = Since \Mi\ — \Si \ + g — 1, we have 

The following proposition will be useful when we are 
constructing high-rate multigroup ML-decodable codes in Sec- 
tion UlLBl 

Proposition 4: Let fc > 1. There exists an explicitly con- 
structable set Hk C C*^^*^ consisting of fc^ matrices that are 
unitary, Hermitian and linearly independent over R. 

Proof: For fc = 1, Hi ~ { [l] } satisfies all the properties 
in the hypothesis. Let us assume that k > 2. 

For a matrix A, let A{n,m) denote the entry in 
the n*^ row and m*'' column of A. For n = l,...,fc, 
define Z„ e C''^'' as follows. Zi ^ diag{l, 1,1, ■■■ ,1), 
Z2 = diag{l, —1, 1, . . . , 1), Z3 = diag{\, 1, —1, . . . , 1), . . , 
Zk = diag{l, 1,1,..., —1). For n = 1, . . . , fc — 1 and m > n 
define Vn,m € C*^^*^ as follows. 

Vn,m{n,m) = Vn^m{m,n) = 1 , Vn,m{l,l) ^ 1 fOT I ^ 71, m. 

All other entries of Vn^m are set to zero. For n — 1, . . . , fc — 1 
and m > n define Wn,m € C*"'^*"' as follows. 

Wn,min,Tn) = ~Wn^m{m,n) = i,Wn,mil,l) = I, I ^ n,m. 

All other entries of Wn,m are set to zero. We now prove that 
T-Lk — {Zn} U {Ki,m} U {Wn.m} Satisfies all the properties 
in the hypothesis. It is straightforward to see that every 
matrix in T-Lk is unitary and Hermitian. Further, |{^ri}| ~ k, 

\{Vn,ra}\ = and \{Wn,m}\ = ■ ThuS, \Hk\ = k\ 

It only remains to show that the set T-Lk is linearly indepen- 
dent over R. For n = 1, . . . , fc and m> n, let C„, an,m, Pn,m 
be real numbers such that 

fc fc-i 

n— 1 n— 1 m>n 

Then, for each 1 < p, q < n we have 

fe 

{Ctn,myn,m{p,q) Pn,7nWn,m{p,q)) 
fc 

+ ^C,nZn{p,q) = 0. 
n=l 

For p = 1, . . . , fc — 1 and q > p, we have Zn{p, q) = for all 

Vn,m{P,q) = Sn,pSm,q and Wn,m{p,q) = iSn,pSrn.q- ThuS, 

ap^q + = 0. Hence, we have ap,q = Pp^q = 0. Since the 
above sum reduces to X]f^=i CnZn, it only remains to show 
that Cn = for all n. J2n=i CnZn = is equivalent to the 



proposition that (Ci , . . . , (^fe)^ is in the null-space of the matrix 



1 

-1 
1 



= ll^-rfzag(0,2,...,2), 



1 1 1 ••• -1 
where 1 is the all ones vector Thus, we have 

= dm5(0,2C2,...,2a). 

Equating the components of the vectors on either side of 
the equation, we have = X]n=i Cn = 2C,„ for m > 1. Thus, 
= X]n=i Cn = Ci- This completes the proof. ■ 
Example 8: The set H2 C C^^^ consists of the following 
4 matrices: 




'1 0' 




'1 


0' 







1' 


and 


i 


1— ' 


1 





-1 




1 









Example 9: The set H3 C C'^'*'^ consists of the following 
9 matrices: 





-> 



B. Asymptotically-good, delay-optimal multigroup ML decod- 
able codes 

In this subsection we construct {Ni\l — !,...,(;} leading to 
a family of delay-optimal, asymptotically-good, (7-group ML- 
decodable codes, for g > 1. 

Theorem 2 ( [8^): The maximum rate of a square orthogo- 
nal design in K complex symbols is cspcu. 

From Theorem|2l we see that, there exists a square orthogo- 
nal design of dimension 2^^^ x 2^"^ in K complex symbols, 
i.e., in 2K real symbols. The linear dispersion matrices of 
a square orthogonal design are all unitary and are pairwise 
Hurwitz-Radon orthogonal. Thus, for m > 0, there exist 
2(771 + 1) unitary, K-linearly independent, pairwise Hurwitz- 
Radon orthogonal 2™ x 2™ complex matrices. Let us denote 
this set of mati'ices by Om = {Ui, . . . , U2m+2\- 

Let .9 > 1 be any integer. Let A^ = 7i2Lt~J and 
n > g. Then, ^ = p + e, where p > 1 is an integer 
and ee [Ojl)- Then, n — gp + ge, where gp is 
a positive integer and t — ge G {0, 1, . . . , g — 1}. 
Assign Ni = N2 = ■■ ■ = Nt ^ {p+ 1)2 and 
iVt+i = • • • = iVg = p2 J . Clearly, = ^■ 

Let Hp C CP^P and Hp+i C Cp+^''p+[ be the sets of 
matrices promised by Proposition |4] Let Hp+i be any subset 
of Hp+i of cardinality p^. The construction procedure is 
given in two cases below. 

Case 1, g is even: Here, g ~ 2{m + 1) for some 777, > 
and 777 — [^^J. With Ui as the weight matrices of the 



2™ X 2™ maximal rate square complex orthogonal design, for 
I — 1, ... ,t, define: 

Si = {Ip+i ® Ut\i = 1, . . . , 2t77 + 1} and 

Si = {A(E)U2r,i+2\Aenp+i}. 

For I ^ t + 1, . . . , g, define: 

Si — {Ip (8) Ui\i = 1, . . . , 2777 + 1} and 
Si = {A(E)U2,n+2\AeHp}. 

Assign Afi = SiUSi, I — 1, . . . , 5. It is straightforward to show 
that the sets of matrices {J\fi\l — 1,. . . ,g} satisfy the condi- 
tions CI and C2 of the construction procedure in Section lTlI-AI 
Number of real symbols in each group of the design resulting 
from the construction procedure in Section IIII-AI applied to 
the sets of matrices {Afi} is p^ + g — 1. The rate of the code 



g(p +g-i) 



2N 



R 



which is the same as 
g N 



i\ 1- 



2N 



cspcu. 



(3) 



Case 2, g is odd: In this case, g = 2m- 
With Ui as the weight matrices of the 2" 
square complex orthogonal design, for I 



-1, for some m> 1. 
X 2™ maximal rate 
= 1, . . . ,t, define: 



Si = {Ip+i ®Ui\i = I, . . . ,2m} and 
Si = {A(g, B\A e Hp+u B e {U2m+uU2m+2}}- 
For I = t + 1, . . . , g, define: 

Si = {Ip (g) Ui\i = 1, . . . , 277^} and 

Si = {A® B\A e-Hp^Be {C/2™+i, U2,n+2}}- 

Assign Ml = SiUSi, I = 1, . . . ,g. Again, it is straightforward 
to show that the sets of matrices {J\fi\l = l,...,g} satisfy 
the conditions CI and C2 of the construction procedure in 
Section lTlI-AI Number of real symbols in each group of the de- 
sign resulting from the construction procedure in Section lTlI-AI 
applied to the sets of matrices {Afi} is 2p^ + g — 1. The rate 
of the code is ^^^^2n'^^^ which is the same as 



R 



9_ 

N 



N 



52 



2N 



cspcu. 



(4) 



From (O, (HI and Corollary |2] for any n > g, we have a 
g-group ML-decodable, delay-optimal, full-diversity STBC C 

r72L^J, 



for number of transmit antennas N 

9 



R{g,N) 



N(l + l{g is even}) 



N 



32 



with rate 

2N 



(5) 



Every codeword matrix of C is block-diagonal, with g square 
complex matrices on the diagonal. The first t matrices are of 
dimension (p + 1)2L"S-J and the remaining g — t matrices 
are of dimension p2L'T~J. When N ~ pg2^^^^, p > I, i.e., 
when 77 = pg, we have e = t = 0. Thus, every codeword in C 
is block-diagonal with q complex matrices on the diagonal. 



each of dimension p2 L 



Example 10: Consider = 6 and g — 2. In this case, 
n = 6 and is a multiple of g. Hence, p = 3 and t — 0. 
Further, m — [^^J — and the maximal rate square orthog- 
onal design of size 2™ = 1 has weight matrices 
Thus, 82 — iV-s and Si ~ S2 ^ {h} where is 

given in Example |9] The two groups of weight matrices are 
Ml = /l(^H3,/3) and M2 = 12(^3, h). Since IH3I = 9, 
the number of real symbols in each of the two groups 
is 10. The rate of the resulting design is 5/3 cspcu. Let 
{xi, . . . , xif)} and {xn, . . . , 2:20} be the symbols associated 
with the weight matrices in A^i and A^2 respectively. The 
resulting 2-group ML-decodable code is given in (|6]l at the top 
of the next page, where the auxiliary variables in the design, 
Zj, j ~ 1, . . . ,8, are 



Zl = 


Xi + 


X2 + 


X3 + Xs 


+ Xg+ Xio 




Z2 = 


Xi - 


X2 + 


X3 + xe 


+ X7 + X10 




Z3 = 


Xi + 


X2 - 


X3 + X4 


+ X5 + Xio 




za = 


Xii - 


\- X12 


+ X13 + 


Xis + Xig -t 


- 2:20, 


Zb = 


xii - 


- Xl2 


+ Xi3 + 


X16 + Xi7 4 


- 2^20, 


Z6 = 


xii - 


F 2:12 


- Xi3 + 


Xl4 + Xi5 A 


- 2:20, 


Z7 = 




1 ^i' 


and zg = 


-T^° X- 




e 11. 


Let 


N = 


12 and 


5 = 3. We 


see that 



is a multiple of g. Thus, e = t — 0, p = j — 2 and 



m = [^-2^\ = 1. Thus, we are concerned with square or- 
thogonal design of size 2™ = 2. This is exactly the Alam 

outi code f30l, whose weight matrices are Ui = ^ ^ 



, C/3 = 
and A2 



I 


h ® U2 



and C/4 = 
With -H2 







-1 

1 
as given m 



IJ' 
Let 



i 
-i 
Ai = I2 «) Ui 

Example [8] let 5 = {A ® B| A e H2, S e {[/g, J74}}. Let 
TVi 7V2 = A/'a 5 U {^1,^2}- The three groups 
of weight matrices are then given by A^i = fi{S , Ai, A2), 
M2 = f2{S,A2,Ai) andTWa = /3(5, Ai, ^2). Each group is 
composed of 2p^ + g — 1 = 10 real symbols. The rate of this 
design is 5/4 cspcu. Associate the real symbols {xi, . . . , 2:10}, 
{xii, . . . , X20} and {2:21, • • • , 2:30} with the weight matrices 
in A^i, M2 and 7M3 respectively. For k = 1, 2, 3, define the 
following variables. 

Vk = 2;4+io(fc-i) + 2;7+io(fe_i) + i(2;3+io(fe_i) - 2;8+io(fc-i)), 
yk = 2;4+io(fc-i) - 2;7+io(fe-i) + i(2;3+io(fe-i) + 2;8+io(fc-i)), 

Uk —X5+io{k-l) + 2;6+10(fc-l) + 

*(2;i+io(fc-i) +2;2+io(fc-i) + xg+io(^k-i) +xiok), and 



Zk — 2;5+io(fc-l) — X6+io(fc-l) + 

*(2;i+io(fe-i) — 2;2+io(fc-i) +X9+io(fc-i) +2;iofc). 
Also define 

j=21 ~ ^29 +2:30 + 1 ( Lj = ll + - X20 
^2 = Sj=l ^3 - Xg+Xia+i {j2%21 + ^29 " 2:30^ & 
■^3 = Z]j=ll - 2^19 + 2:20 + i (Z]j=l ^3 +Xg- Xio 



The resulting design is given in (|7]i at the top of the next page. 

Example 12: Consider 20 and g = 3. is a multiple 
of 2Lt~J = 2. a full-diversity, 3-group ML decodable code 
can be constructed for = 20 using the procedure given in 
this subsection. From (|5]), this new code has rate i? = 3/2 
cspcu. 

We now show that the new class of g-group ML decodable 
codes, .g > 1, constructed in this subsection has asymptotic- 
normalized-rate g2fi-^ and hence is asymptotically-good. 
Thus, by increasing A^, we get g-group ML decodable codes 
with rates greater than 1 cspcu for any g > 1. 

Proposition 5: Let g > 1. The asymptotic normalized 
rate of the family of g-group ML decodable STBCs 
constructed in this subsection for number of antennas 
N e {n2L^J |n e Z, n > c/} is ^5?^. 

Proof: We prove the result for the case when g is odd. 
The proof for even values of g is similar. Consider the ratio 
of rate (|4]i to number of antennas A^, 



R 

N 



9 



27V2 



9 

iV2 



A^ 



<?2 



i\ 1- 

9 -1 



Since 




N = 


N \_ 


n 


n 


U2^J " 


9 j 


~ 9 



we 



have 



e^v, € [0, 1). Therefore, 



R 

N 



9-9 
2n229-i 
2 



229-1 



EAT 



9 



n^29 
2 



n^29- 



2nepf 



9 



1 



2e 



N 



34 



(8) 



n^29 g29-^ ng23-^ n?29-^ 

The limit N 00 corresponds to the limit n ^ 00. When 
n — > 00, in the summation ([8]l, only the second term does not 
vanish. This completes the proof. ■ 

C. Asymptotically-good, non-delay-optimal, multigroup ML- 
decodable codes 

In this subsection, we construct non-delay-optimal, full- 
diversity, (7-group ML-decodable codes for g > 1, using the 
codes in Section |IV] leading to higher asymptotic normalized 
rates. 

Let A be a square, block-diagonal, complex matrix with 
g square matrices {Ci,...,Cg}, each of dimension k y. k 
on its diagonal. Define 4>{A,g) = [Cf , . . . , Cj]^. Note that 
4>{A,g) is of dimension gfc x fc. If ^1, . . . , Ak are K such 
block-diagonal matrices that are linearly independent over M 
then, it is clear that, (f){Ai, g), . . . , (j){AK , 9) are also linearly 
independent over M. 

Let p > I and A^ = pg2^^^. Let C be a g-group 
ML-decodable, delay optimal STBC obtained from the 
construction in Section UlI-BI using a design X = J2iLi XiAi 
and a full-diversity achieving signal set A C M^. That 
is, C = {Y.f=i ■ ■ ■ ,^k) & A}. Every codeword 

in C is block-diagonal with g complex matrices on the 
diagonal, each of dimension p2L'T~J xp2L~2-J. From (|5]l. 



Xe 



zs + izi 

X5 + iX4 
XT + iX6 







—X5 + ixi 

Zs + iZ2 

xq + ixs 






— X7 + iX6 

— Xg + ixs 
Zs + iZ3 











Zt + 124 
Xi5 + ix-14 

Xi7 + ixie 







-Xi5 + iXi4 
Z7 + iZ5 
X19 + iXi8 







-xn + ixie 
"Xi9 + ixis 
Z7 + ize 



(6) 



X12 



= diag 



'Wl 











'•W2 


-U2 


Ul 











U2 


W2 





yi 


Wl 


-zl 


1 





2/2 


_Vl 





Z\ 


w\ _ 




V2 








W2 
Z2 



-V2 



-Z2 
W2 _ 





V3 








-V3' 









ya 




-Z3 





z-i 


^3 . 



(7) 



we have K ~ g2g-i +9^—9- Let ri,...,rg be subsets 
of {1, . . . , such that for j = 1, . . . , the j*'' decoding 
group is {xi\i £ Tj}. Consider a pg2^^^^ x p2Lt~J 
STBC C, given by C ^ {(t>{C, g)\C e C}. That is, 
C = {Lili • ■ • ,ax]^ e -4} which can 

be obtained from the design Y = ^^j^ 5) in real 

symbols yi, and the signal set A. 

Proposition 6: C is a g-group ML-decodable, full-diversity 
code of rate R' = g^^-i + ^JN^ cspcu. 

Proof: Since Ai, . . . , Ak are linearly independent, 
0(^1, g), . . . , (f>{AK, g) are also linearly independent over M. 
Thus, it is clear that the rate of the code in complex symbols 
per channel use is R'. 

Let 1 < ji < j2 < 9 and ii e Tj^, 12 € Tj^. 
For / = 1,2, let A^, = diag{Ai^[\\^ . . . , Aii[g]). Since 
and Xi^ belong to different decoding groups of the code C, 
we have Af^Ai^ + A^^Ai^ = 0. This implies that for each 
k = l,...,g, Ai^ [k]"Ai^ [k] + Ai^ {k\^ Ai^ [fc] = 0. Hence, 

<\)[A,, , g)"cp{A,, , g) + 4>{A,, , g f^iA, , g) 
= J2 (^n [k]"A, [k] + A, [kfA,, [k]) 

k=l 

= 0. 

Hence, for j = l,...,g, the j*^ decoding group is 
{Vil'i G Tj}. Since C is balanced, C is also balanced. It only 
remains to show that the code C offers full-diversity. Consider 
any two distinct codewords C( and C2 in C. There exist dis- 
tinct Ci , C2 e C such that, C( = ^(Ci , g) and = (;!)(C2 , g). 

AC = C[-C'2 = HCi,g) - 0(C2,5) = <I>{C - C2,g). 

For 1 = 1,2, let C; = dm.g(C; [1] , . . . , C; [2]). Let 
AC = Ci - C2 = dm5(AC[l], . . . , AC[g]). Since C offers 
full-diversity, we have, det{AC) =l\l^-^^det{AC[k]) 0. 
Hence, for each k = 1, . . . , g, det{AC[k]) 7^ 0. Thus, 

det{AC'"AC') = det {^AC[k]" AC[k]j ^ 0. 

This follows from the fact that, the sum of two positive definite 
matrices is also positive definite. We have thus shown that the 
STBC C offers full-diversity. ■ 



For each g > 1 and p > 1, we have constructed 
a g-group ML-decodable STBC for number of trans- 
mit antennas N'=p2^^~^, delay T' = gN' and rate 
R'{N') = + 9W' asymptotic normalized rate of the 
family of codes corresponding to a fixed g is r{g) = 2?^- 
The asymptotic normalized rate of these codes is better than 
those of the codes in Section |IV] by a factor of g. However, 
the delay is larger by the same factor g. 

Example 13: Consider the 3-group ML-decodable code 
Xi2xi2 0. Define Xi2x4 = 0(Xi2xi2, 3). Then, Xi2x4 
is a full-diversity, rate 5/4, 3-group ML-decodable code for 
4 transmit antennas with a delay of 12 channel uses. The 
code is comprised of 30 real symbols {xi, . . . , X30}. With 
the variables as defined in Example [TT] the resulting code is 
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-yi 








yi 
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W2 
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W3 
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D. Comparison with other Multigroup ML-decodable codes 

We consider only balanced multigroup ML-decodable codes 
available in the literature. A summary of comparison is 
given in Table |T] The delay T of maximum rate orthogo- 
nal designs is lower bounded by cr{N), where for to > 0, 
cr(2TO) cr(2TO- 1) = (^"\) liu. Delay-optimal 2-group 
ML-decodable codes in iflTl have asymptotic-normalized-rate 
r = 1/4, but were constructed only for 2™ antennas. We have 
constructed 2-group ML-decodable, delay-optimal codes for 
all antennas iV > 2 in Section IIII-BI For the particular cases 
of iV = 4 and N = 2™ these new codes have the same rates 
as that of the codes in |fT6l and ifTTl respectively. The 2-group 
ML-decodable codes of Section IIII-CI have the same rate as 
that of the codes in fl8 | for identical delay T = 2N. 

The new codes constructed in this section are the first in- 
stances of asymptotically-good g-group ML-decodable codes 
reported in the literature for g > 2. 



TABLE I 

Comparison of Balanced Multigroup ML-decodable codes 



Code 



Transmit 
Antennas 

N 



Delay 



T 



Groups 



Rate 
(cspcu) 
R 



Asymptotic 
rate 



Asymptotic 
normalized rate 



Square COU^ 



2™, m > 



N 



2RT 



m + l 
2'" 



Maximal Rate COD^ 



> 1 



at least 



2RT 



SSD Codes ^ 



2™, m > 1 



T 



RT 



CUWDs^ 0, fT4l 
Natarajan et. al. 1151 



2™, m > r| - 11 



N 



> 2 



9±1 I 



Dao et al. Q] 

de Abreu fTT\ 



2™, m > 1 



N 



Dao et al. [D 
Rajan et al. (31 



2m, m > 1 



N 



Yuen et al. [161 



5/4 



NA 



NA 



Srinath et al. U7|| 



2™, m > 2 



N 



JV I j_ 
4 ^ JV 



Ren et al. [HI 



> 1 



> 2N 



T 



1 for T = 2N 



New codes in Sec IIII-Bl 



g-i 

m2 L 2 J , m > g 



N 



> 2 



_i N 



N(l + l{g is even}) [^^^S^j 
2N 



g2S- 



New codes in Sec IIII-Cl 



m2LT~J, m > 1 



gN 



> 2 



N I g-1 
2S-1 ' 2N 



Division Algebra codes 121] 



> 1 



JV 



N 



t For m > 1, a{2m) = a{2m - 1) = 
Key : 1) COD - Complex Orthogonal Design, 2) SSD Codes ■ 
3) CUWDs - Clifford Unitary Weight Designs. 



Single complex symbol ML-decodable codes. 



IV. A New class of Fast-Group-Decodable Codes 

In this section, we construct a new class of FGD codes for 
all even number of transmit antennas and rates 1 < i? < 5/4. 
In lITSl . FGD codes for the same range of rates, but for a 
smaller class of antennas, number of antennas that are a power 
of 2, were constructed. The FGD codes of this section match 
the FGD codes of ifTSl in terms of ML decoding complexity 
when the number of antennas is a power of 2. All the codes 
considered in this section are delay-optimal. 

A. A new class of FGD codes 

We first construct a rate 5/4 FGD code for 2m antennas, 
m > 1. Codes of rates less than 5/4 are then obtained via 
puncturing. Let the number of transmit antennas N = 2m, for 

some positive integer m. Let X = 



and Z = 



l2,iX,iZ and ZX are the weight matrices of the 
code and are mutually Hurwitz-Radon orthogonal 

r*™ C C"'^™ as 



1 
-ij' 
Alamouti 
Define 



P,,, = {diag{l, 1, 1, . 

diag{l, 1, 



. ,l),djag(l,-l,l,...,l), 
-l,...,l),...,diag{l,l,l, 



-1)}- 



All the matrices in Dm are Hermitian and mutually 
commuting and |2?m| = ™- Let V — {l2,iX,iZ, ZX,il2}- 
Let Vi^ih} and V2 = {iX,iZ, ZX^ih}. For any 
ScC^""^, let S(g>Vm = {A(g)B\AeS,B eVrn}- The 
proposed N x N design is Xfgd = J^Aev-^v^ ^aA. 
Proposition 7: The rate of the design Xfgd is 5/4 cspcu. 
Proof: \r -S) Vrn\ = \V\\Vrn\ = 5m. The rate is 
^^j^ = 5/4 cspcu provided the set V <8) Vm is linearly in- 
dependent over M. It is therefore enough to show that both 
the sets V and 2?„i are linearly independent over R. Proving 
linear independence for V is straightforward once the matrices 
are vectorized. We now show that the set 2?.,„ is linearly 
independent. Since the elements of Dm are diagonal this is 
equivalent to showing that the following mxm matrix, whose 
m columns are the diagonal entries of the m matrices in Dm, 
is full-rank. 



111 

1 -1 1 
1 1 -1 



= 11^ -dtagi0,2,...,2), (9) 



where 1 is the all ones vector Let a ~ [oi, . . . , a™]-^ G R™ 
be in the null-space of (|9]l. Then we have, 

/ m \ 

1 ( ^ On 1 = diag{0, 2a2, . . . , 2a„i). 

Equating the components of the vectors on either side of 
the equation, we have = X]r=i = '^O'k for fc > 1. Thus, 
= ^™^]^ a„ = ai. This completes the proof. ■ 

We now show that the proposed code is fast- 
group-decodable. Let Xi = ^ Ai^Vit^v and 
^2 = I]agP2®23„ ^aA. We have that Xfgd = Xi + X2. 

Proposition 8: The design Xfgd is 2-group ML decod- 
able. The two groups correspond to the symbols in the designs 
Xi and X2 respectively. 

Proof: It is enough to show that for any A E Vi (E) 'Dm 
and any B Vm, we have A^B + B" A = 0. Equiva- 

lently, A^ B must be skew-Hermitian. A and B are of the 
form A ^ I2 ® Da and 5 = 5® Db, where S" G P2 
is skew-Hermitian, Da and Db are diagonal, commuting 
and Hermitian. Thus, A^ B = S (S) DaDb, which is a skew- 
Hermitian matrix. This completes the proof. ■ 

Proposition 9: The design X2 is conditionally 3-group ML 
decodable i.e. fast-decodable. 

Proof: It is enough to show that the design 
J2Ae{iX,iZ,zx}(sv^ ^aA, whose weight matrices form 
a subset of weight matrices of X2, is 3 group ML decodable. 
It is straightforward to show the matrices belonging to the 
three sets {iX} (g) 2?m, {iZ} (g) Dm and {ZX} (g) Dm are 
mutually Hurwitz-Radon orthogonal. ■ 

From Propositions [8] and |9] it is clear that Xfgd is 2-group 
ML decodable and one of the groups is fast-decodable, proving 
our claim that Xfgd is fast-group-decodable. The rate-1, 4- 
group ML decodable design given in [2] has the elements of 
{l2,iX, iZ, ZX} (g) Dm as weight matrices. The new design 
Xfgd contains the matrices in {i/2} ® I?m in addition to the 
weight matrices of the design in (2). 

Designs of rate 1 < i? < 5/4 can be obtained by puncturing 
appropriate number of symbols from the set of symbols 
corresponding to {i/2} ® 2?m- Similar to Propositions |8] and |9] 
it can be shown that the designs thus obtained are fast- 
group-decodable. The resulting design will be 2-group ML 
decodable, the first group containing symbols corresponding 
to Vi (E) and the second group containing the rest of the 
symbols. Again, the second group is conditionally 3-group ML 
decodable, the three conditional groups correspond to the three 
sets {iX} (g) T>,n, {iZ} (g) and {ZX} (g) 

B. ML Decoding Complexity of the new class of FGD codes 

We assume encoding using full-diversity achieving, regular- 
PAM constellation as promised by Corollary |2] Let the size 
of the underlying complex constellation used for encoding 
the proposed design of rate R, 1 < i? < 5/4, be M. Let 
the number of transmit antennas be = 2m. The group 
of symbols corresponding to Vi g) Dm can be ML decoded 
independently of other symbols. For each of the 
values that a subset of m — 1 real symbols of Vi g) Dm 



can take, the conditionally optimal value of the last real 
symbol can be found by simple scaling and hard limiting 
since its constellation is a regular PAM. The first group can 
thus be ML decoded with complexity M ~2~ . The second 
group consists of symbols that correspond to {iX} g) Dm, 
{iZ} g) Dm, {ZX} g) Dm and a subset S) C {ih} g) Dm of 
cardinality 4:m{R — 1). For each of the Af^™(^~^^ values that 
the symbols of T> can jointly take, the three groups of symbols 
corresponding to {iX} g) Dm, {iZ} g) Dm and {ZX} ® Dm 
can be independently decoded, each with complexity . 
Hence, the proposed code can be ML decoded with a complex- 
ity of M^/2-0.5 ^ ]^pm(R-l) . 3^m/2-0.5_ p^j. j^j-gg 

is approximately equal to 

3M^(4fl-3)-0.5_ (10) 

The FGD codes of ifTSl . constructed for number of antennas 
N that are a power of 2 and rates 1 < < 5/4, can be 
ML decoded with complexity 3A/t(4«-3)-o.5 j\iyj^^^ the ML 
decoding complexity of the new FGD codes of this section 
match the ML decoding complexity of FGD codes from ITSl 
when the number of antennas is a power of 2. 

V. A NEW CLASS OF ASYMPTOTICALLY-OPTIMAL, 
FAST-DECODABLE, FULL-DIVERSITY STBCS 

In this section, we first give a general procedure to construct 
high-rate, full-diversity FD codes using lower rate multigroup 
ML decodable codes and FGD codes. We then construct 
new STBCs for rates R > 1 and number of antennas > 1 
using the delay-optimal, asymptotically-good multigroup ML 
decodable codes of Section IIII-BI and the FGD codes of 
Section |IV] Finally we show that the new class of codes 
have the least ML decoding complexity among all the codes 
available in the literature for > 1 and R > 1. Among 
the new codes, the class of full-rate codes are asymptotically- 
optimal and fast-decodable, and for iV > 5, have lower 
ML decoding complexity than all other known families of 
asymptotically-optimal, full-diversity, FD codes. All the codes 
considered in this section are delay-optimal. 

A. A new class of low ML decoding complexity STBCs from 
multigroup ML decodable codes and FGD codes 

Let Xb = Y^fj^i ^iAi be an iV X iV design of rate 
Rb = Ki,/2N cspcu. We have the following propositions. 

Proposition 10: Let Rb < R < N and K = 2RN 
and B be an R-basis of C^^^. Then, there exist K - Kb 
matrices Ak^+i, ■ ■ ■ , Ak G B, such that the set {^1 , . . . , Ak} 
is linearly independent over R. 

Proof: Let B ^ {Ui, . . . ,U2n^}- Assign 
Ci^Ai, 1 = 1,..., Kb and Ci = Ui-Ki for 
i = Kb + I, . . . , Kb + 2N'^. Consider the set 
So = {!,..., K + 2N^}. For any S C So, define 
M{S) = {Ci\i € S}. Since S is a R-linear basis for 
C^x^, spam{M{So)) = C^x^. 

Find the smallest j £ Sq such that 
Cj £ spanR{{Ci, . . . ,Cj^i}). Such a j always exists 
because the set A4{Sq) is linearly dependent. Since Xb 



is a design, its weight matrices are linearly independent 
over R. Hence, Ci , . . . , Ck^ are linearly independent over 
M and thus we have j > Ki,. Let Si ~ Sq \ {j}. Thus, 
{1, ...,Kb)(lSi and span^iMiSi)) ^ C^^^. In general 
for i > 1, if M.{Si-i) is linearly dependent, find the smallest 
j g such that Cj G spanR({fc e 5i_i|fc < j}) ™d let 
S^ = \ {]}. It is clear that, span^{M{Si)) = C^^^ 
and {l,...,i4rb} C Si. Since 5o is a finite set, there 
exists a finite I such that M.{Si) is linearly independent 
over M while A^(5;_i) is not. Also, by construction, 
span^{M{Si)) ^ C^^^. Thus, M(Si) is a basis for C^^^ 
and M {Si ) contains Ai, . . . , A^^ ■ 

From the hypothesis of the proposition, K < 2N^ and 
K - Kb < 2N'^ - Kb. Label any of the K - Kb matrices 
from M{Si) \ {Ai, AkJ as Ak^^.^,. . . , Ak- This com- 
pletes the proof. ■ 

Proposition 11: Let k >1. There exists an explicitly con- 
structable R-basis Bk of C*^^*^, consisting of unitary matrices. 

Proof: Let T-Lk be the explicitly constructable set of fc^ 
unitary, Hermitian, R-linearly independent fc x fc complex ma- 
trices from Proposition!!] Let Bk ~ 'Hk^i'Hk- Let spanM.{Hk) 
denote the subspace spanned by elements of T-Lk over R. Bk 
is explicitly constructable since T-Lk is explicitly constructable. 
Also, \Bk\ = 2fc^. Since the dimension of C'^^'^ as a vector 
space R is 2fc^, it is enough to show that dim^{span^{Bk)) = 
2fc^, where (iirnR(-) denotes the dimension of a subspace over 
R. We have that, 

dim]R{spanR{Bk)) 

= dimm{spanR{'Hk)) + dim^{spanM.{iT-Lk)) 

- dimR{spanR{'Hk) n spanR{i'Hk)) 
= 2k^ — dim]gi{spanR{'Hk) H span-jgXi'Hk))- 

Thus, it is enough to show that 
dimTgi{spanfi{'Hk) H spans.ii'Hk)) = 0. Let A belong to 
the intersection of the two subspaces spanTR{Hk) and 
span^iiHk). This would mean that A is both Hermitian and 
skew-Hermitian. Thus, A = A^ = —A. Hence, A = 0. This 
completes the proof. ■ 

The result of Proposition [TO] together with the R-basis Bk 
promised by Proposition [TT] shows that any design Xb, with 
rate Rb < N, can be extended by adding sufficient number of 
real symbols with full-rank weight matrices. The new design 
from Proposition [TOl is X ~ ^i-^i- We say that Xb is 

the base design of X. 

Consider the case when all the weight matrices of 
the base design Xb are full-rank. Then, all the weight 
matrices of the design X are also full-ranked. For any 
given Q, from Corollary |2] there exist Q-ary regular 
PAM constellations Adi,QT ■ ■ TAdK,Q with minimum Eu- 
clidean distance di , . . . , respectively, such that the STBC 
C(X,^dj_Q X ••• X AdK-o) of full-diversity. Let M be 
the size of the underlying complex constellation used, i.e., 
M — Q^. ML decoding can be performed by finding the 
optimal values of xi, . . . .xk^ for each of the m'^"^'')/^ 
values that xk^^^ , ■ ■ ■ , xk jointly assume and then by choos- 



ing the tuple xi, . . . , xk that optimizes the ML metric among 
the m(^^^I')/2 tuples from the first step. The problem 
of optimizing xi,...,XKt, for a given set of values for 
Xif^^j , . . . , is equivalent to the problem of decoding 
the STBC C{Xi„Ad^^Q -x ■ ■ ■ X AdK,,Q)- Thus the STBC 
C{X., Adi,Q X • • • X AdK.q) can be ML decoded with a com- 
plexity given in (fTTT l at the top of the next page, where 
M L-decoding-complexity{-) denotes the ML decoding com- 
plexity of an STBC. Further if C{'Kb,Adi,Q x • • • x AdK^,Q) 
were multigroup ML decodable or fast-group-decodable, 
the proposed code C(X, ^^i.Q x • • • x AdK,Q) will be fast- 
decodable. 

Example 14: Consider the FD code for = 4 and 
i? = I cspcu constructed in ll28l . There are 12 real sym- 

12 

bols in this design X. = ^^^^XiAi. The base design 
Xb = X]i=i XiAi is the maximal rate 4x4 complex or- 
thogonal design [8,]. The matrices At,...,Ai2 are cho- 
sen as = AiA' for i = 1, . . . , 6, where A' G C'*^'' is 
a full-rank matrix chosen to maximize the diversity and 
coding gain. The real symbols are encoded independently 
from a regular PAM constellation of unit minimum Eu- 
clidean distance, Aq-pam- Since Xb is an orthogonal de- 
sign, C{Xh,AQ-PAM X ■ ■ ■ X Aq-pam) can be ML de- 
coded with 0(1) complexity. Hence, from ( fTTT l. the FD STBC 
C(X-, Aq-pam X ■ ■ ■ X Aq-pam) can be ML decoded with 
a complexity of the order of M^. 

From the above discussion, it is clear that new FD codes 
of rates R > Rb can be constructed using already known 
multigroup ML-decodable and FGD codes of rate Rb- When 
we are interested in rates R < Rb, low ML decoding complex- 
ity codes can be obtained by puncturing known multigroup 
ML decodable and FGD codes of rate Rb- In this case too, 
we say that Xb is the base design of the new code. It 
is straightforward to show that the order of ML decoding 
complexity, when the base design is g-group ML decodable 
and R < Rb, is 

M(rfi-i)/2. (12) 

B. Explicit constructions and their ML decoding complexity 

In this subsection, we explicitly construct STBCs with low 
ML decoding complexity (multigroup ML decodable, FD and 
FGD) for rates greater than 1 cspcu and > 1 transmit 
antennas. To do so we start from base designs of rate 1 cspcu 
or higher. From ( fTTT ) it is clear that for the constructed code 
to be of low ML decoding complexity, the base design itself 
should be of low ML decoding complexity. To the best of 
the authors' knowledge, multigroup ML decodable and FGD 
codes from among the following three classes of codes offer 
the least known ML decoding complexity for rates i? > 1 and 
number of antennas A^ > 1. 

• J^DAST- 2-group ML decodable, full-diversity. Diagonal 
Algebraic Space-Time (DAST) block codes from ||32l for 
any number of transmit antennas A^ and rate R = 1 cspcu. 

• ^g,AG, ff > 1- 5-group ML decodable, delay-optimal, 
asymptotically-good STBCs in Section lTlI-Bl of this paper 
for number of antennas, A^ = n2L~2-J, n> g. 



j^j(K-Ki,)/2 ^ ML-decoding-complexity (C(Xb, y4^di,Q x • • • x Ad^ ,q)) (H) 



TABLE II 

Exponent of Mt in the order of ML decoding complexity of the new STBCs and their associated base designs 



Transmit 
Antennas 


Rate, R in cspcu 


5/4 


2 


3 


4 


5 


6 


7 


8 


9 


10 


2 


0.5 

J^FGD 


2 

J^FGD 


















3 


2 

J^DAST 


4 

J^DAST 


7 

J^DAST 
















4 


1.5 

J^FGD 


4.5 
J^FGD 


8.5 

J^FGD 


12.5 

J^FGD 














5 


3.5 


7 

J^DAST 


12 
J^DAST 


17 

J^DAST 


22 
J^DAST 












6 


2.5 

J^FGD 


6.5 

"^2, AG 


12.5 

■^2, AG 


18.5 
-^2, AG 


24.5 
-^2, AG 


30.5 

-^2, AG 










7 


4 

J^2,AG 


8.5 

■^2, AG 


15.5 
.^2, AG 


22.5 

-^2, AG 


29.5 

-^2, AG 


36.5 

-^2, AG 


43.5 

-^2, AG 








8 


3.5 

J^FGD 


7.5 

"^2, AG 


15 
"^2, AG 


23 
-^2, AG 


31 

-^2, AG 


39 

-^2, AG 


47 

-^2, AG 


55 

•^2, AG 






9 


5.5 

-^2, AG 


9 

-'^2,AG 


18 
-'^2,AG 


27 
-^2, AG 


36 

-^2, AG 


45 

-^2, AG 


54 
-^2, AG 


63 

-^2, AG 


72 

-'^2,AG 




10 


4.5 

J^FGD 


9.5 
■^2, AG 


16.5 

"'^2,AG 


26.5 

-^2, AG 


36.5 

-^2, AG 


46.5 

-^2, AG 


56.5 

-^2, AG 


66.5 

"'^2,AG 


76.5 

-'^2,AG 


86.5 

"^2, AG 



t AI is the size of the underlying complex constellation. 



• J'FGD- FGD codes for even values of N and rate 5/4 
cspcu in Section HVl of this paper. 

The codes in Ffgd and Fg^AG, <? > 1, have full-rank 
weight matrices. We now show that the codes in Fdast can 
be obtained by using designs with full-rank weight matrices 
and by independent encoding of real symbols using Aq-pam- 

Proposition 12: Let Co G J^dast be a code for N 
transmit antennas that uses an underlying complex 
constellation of size Q^. There exists an x 
design Xq with full-rank weight matrices such that, 
Co = C{X.o,Aq-pam X • • • X Aq-paai)- 

Proof: In [32], the full-diversity STBC 
Co was obtained by using the design 
X = diag{xi + ixN+i,S:2 + ixN+2i ■ ■ ■ ,xn + ix2N)- 
The variables [aii, . . . xjy]"^ and [x^^^i, . . .X2n]^ are 
encoded independently using the N dimensional signal set 
A = {[/[fli, . . . , ajvl^lfli e Aq-pam-, j = 1, . . . , N}, where 
U £ R^^^ is an orthogonal matrix. Let U ~ [uiU2 ■ ■ ■ mjv] 
and [xi+i, ■ . .,XN+i]'^ = U^^[xi+i, . . .,XN+i]'^ 

for i = 0,N. Further, let Ak = diag{uk) for 
k — 1, . . . ,N and Ak = i ■ diag{uk-N) for 
k = N + 1, . . . ,2N. It is straightforward to show 
that Co C(Xo, Aq-pam x • • • x Aq-pam), where 
Xq = Y!H=i^i^i- Since Cq has full diversity, we have that 
CAk, fc = 1, . . . , 2iV, is full rank for C G AAq-pam \ {0}. 
Thus, all the weight matrices of the design Xq have full-rank. 

■ 

Suppose we are interested in constructing a low ML de- 
coding complexity STBC for number of antennas N and rate 



R cspcu, where R > 1. The base design for this code are 
to be chosen from the designs in J-fgd, ^dast and J-g^AG, 
g > 1, that are of size N x N. The set of such designs is finite 
and non-empty. It is a non-empty set since J-dast has one 
N X N design for each N > 1. Now, for each of the designs 
in this set, we compute the complexity of ML decoding the 
code of rate R that uses this design as the base. This can be 
computed using (|5]l ( fTOl ). (fTTl l and (fT2l i. Then, from among 
these designs, we pick that design as the base which leads to 
a rate R code of least ML decoding complexity. 

Table |ll] gives the achievable order of ML decoding com- 
plexities and the associated base design for a set of rates R 
and number of antennas N — 2, . . . , 10. These values were 
found through a computer search. A computer search for this 
problem is easy to implement and executes quickly. For N = 2 
and 4 antennas, using the new FGD design of Section |IV] as 
the base gives the least known decoding complexity for any 
rate 1 < R < N and not just the rates discussed in the table. 
For AT = 3, 5 and TV = 7, 9, the DAST codes and the 2-group 
ML decodable codes of Section ITlI-BI respectively, lead to the 
least known ML decoding complexity for rates 1 < R < N. 
For iV = 6, 8, 10, one should use the new FGD designs from 
Section HV] for 1 < i? < 3/2 and the 2-group ML decodable 
designs of Section UlLBl for 3/2 < i? < iV. 

Example 15: We now give a rate, i? = 2, = 6 antenna 
design with full-rank weight matrices. When the real symbols 
are encoded independently using regular PAM constellations, 
the order of ML decoding complexity of the resulting STBC 
is M^-^ . The order of ML decoding complexity of codes 



Xfd 



zs + izi 

Xs + iX4 
XT + iXf, 
X2\ + iX22 



X23 + iX2i 



— X5 + iXA 
Z8 + iZ2 
Xg + ixs 
X23 + iX24 
X21 + iX22 





— X7 + iX6 
~Xg + iXs 
Zs + iZ3 



2:23 + iX24 
X21 + iX22 



X21 + iX22 



X23 + iX24 

Z7 + 124 
Xi5 + ixi4 

xn + ixi6 



X23 + iX24 
X21 + iX22 



-X15 + iXi4 
Z7 + iZ5 
Xig + ixis 





X23 + iX24 
X21 + iX22 

—Xn + ixi6 

-Xig + 1X18 
Z7 + iZ5 



(13) 



in ||20| and [|27 l for these parameters are M^-^ and 
respectively (see Table HIIl l. For the new code, number of 
real symbols K = 24, Kf, = 20 and the base design is the 
rate-5/3, 2-group ML decodable code Xexe dSll constructed 
Section IIII-BI (see Example [TOl i. Denote the rate-2, FD 
design by Xfd = ^iAi and let the base design be 

Xexe = X]i=i ^i^i- The design Xfd is given by ( fTsT l at the 
top of this page, where the auxiliary variables in the design, 
Zj, j = 1, . . . , 8, are as given in Example [TOl 

C. Comparison of ML decoding complexities 

We now compare the ML decoding complexities of the 
STBCs constructed in this paper with other low ML decoding 
complexity codes available in the literature. In this sub-section 
we show that the ML decoding complexities reported in this 
paper are the least known in the literature. 

1) Comparison with codes from l il^ : In [[15], full-diversity, 
FD codes were constructed for N = 2™, m > 1, antennas and 
rates R > 1 and these codes use, as base designs, a family of 
rate-5/4 FGD codes that have the same known ML decoding 
complexity as the codes from Ffgd- For N = 2,4, and 
R > 1, the codes in ifTSl have least known ML decoding 
complexity in the literature and the new codes in Section [V-BI 
use codes from J-fgd as base for all i? > 1. Hence, for 
= 2,4 and 1 < R < N, the new codes match the least 
known ML decoding complexity in the literature. 

The new codes of Section I V-BI either match or have lower 
ML decoding complexity than the codes in ifTSl for > 8. 
The fact that the new codes match the complexities is clear, 
since one family of base designs J-fgd has same ML de- 
coding complexity as the base designs of codes in IfTSl . We 
now show that the new codes have strictly lower ML decoding 
complexity for all i? > 3/2 and all > 8 that is a power of 
2. 

Proposition 13: Let iV = 2™, m > 3, and 3/2 < i? < N. 
The new codes for N antennas and rate R of Section IV-BI 
have lower ML decoding complexity than the corresponding 
codes of ifTSl . 

Proof: It is enough to show that a rate R code 
that uses the design from J-2,ag as its base has strictly 
lower ML decoding complexity. We prove the result when 
3/2 < i? < f + The proof for i? > f + ^ is similar 
and hence is omitted. Complexity of ML decoding the code 
that uses the design from J-2,ag as its base can be shown 
to be M~~2 for R < ^ + j^- Complexity of the code 
from ifTSll is M~('*^~'^)~2 . Hence, it is enough to show 
that ^(4i? — 3) > But this is indeed the case whenever 
R > 3/2 cspcu. This completes the proof. ■ 



The new codes of Section FV-Bl also give lower ML decoding 
complexity for any 1 < R < 3/2 when compared with codes 
from (|T5 |, but only for sufficiently large values of N. 

Proposition 14: Let i? G (1,3/2]. There exists a positive 
integer N' such that for all N > N' that belong to the 
set {2™|r7i e Z, m > 0}, the new codes of Section IV-BI of 
rate R and number of antennas N have lower ML decoding 
complexity than the corresponding codes of [15|. 

Proof It is enough to show that those codes that use de- 
signs in J^5^AG as base designs satisfy the result of the proposi- 
tion. Let g ^ 5 and choose an N' e {2LV-J+^|^ > \log2{g)~\} 
large enough such that: (1) N' > 20 /R, and (2) for all values 
of e {2L-^J+^|^ > \log2{g)]} that are greater than N', 
the rate of the g-group ML decodable code from Tg,AG of 
size N X N is greater than 3/2. There exists such an A^' since 
the class of codes J^g.AG is asymptotically-good, and hence 
their rate tends to infinity as the number of transmit antennas 
goes to infinity. Now choose any A^, a power of 2, which is 
greater than A^'. Consider the rate R code obtained by using 
the N X N design from J-q^ag as the base. The rate of the 
base design > 3/2 and hence the new code is obtained 
via puncturing and will have an ML decoding complexity 
of ' 29 1 2 ^ where K — 2RN. This complexity can be 
upper bounded by Af 2^+2. Since N > N' > 20/R, we have 
K = 2RN > 2RN' > 40. The codes of rate R for A^ antennas 
from (15) can be ML decoded with complexity A/~'^-'^^'^)^2 . 
Since i? > 1, we have, f(4i?-3) = f- |f >f-^ 
= Hence, it is enough to show that || + 5 < ~ 5- 
Since, g = 5, this is equivalent to the condition that K > 40, 
which we have already shown to be true. This completes the 
proof. ■ 

The value of A^ beyond which the new codes have lower 
ML decoding complexity for a given i? e (1,3/2] is difficult 
to derive analytically, but can be found out easily through a 
computer search. For example, when i? = 5/4 cspcu, the new 
codes of Section IV-BI have lower ML decoding complexity, 
compared with the codes from IfTSl , for all A^ > 32. 

2) Comparison with TAST codes [20^: 

In fSO], Threaded Algebraic Space-Time (TAST) codes were 
constructed for all number of transmit antennas A^ > 1 and 
rate R — N. These codes can be constructed by using the 
DAST codes as the base design. Hence, the TAST codes are 
conditionally 2-group ML decodable, i.e., fast-decodable. This 
low ML decoding complexity property of the TAST codes was 
not reported in [20 1. 

We see that the new codes of Section IV-BI and the TAST 
codes have same decoding complexities for A^ = 3, 5 and 
any R> \, since both use the DAST code as their base (see 



TABLE III 

Comparison of ML Decoding Complexities: Exponent of Mt in the order of ML decoding complexity 



Transmit 


Rate 


New 


FD*, FGD* 


EAST* 


TAST* 


FD 


FGD 


FD 


FD 


Golden 


Silver 


Antennas 




codes 


codes 


codes 


codes 


codes 


codes 


code 


code 


code 


code 


TV 


R 




in [15] 


from 1271 


(20l 


in |22| 


in 151 
111 


in |28] 


in |29l 


E31 


l25l l26l 


2 


1 








0.5 


0.5 
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2 




2.5 


2.5 








2.5 


2 
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0.5 


0.5 
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1.5 
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3/2 


2.5 


2.5 




3.5 
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4.5 


4.5 
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5.5 


4.5 


5.5 
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17/8 
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5 




6 
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1.5 


2.5 
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6.5 
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8.5 
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12.5 




14 


14.5 
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8.5 
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15.5 
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1.5 


1.5 
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3.5 














8 
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7.5 


9.5 


10 


11.5 
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15 


17.5 


18 


19.5 
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23 


25.5 


26 


27.5 
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4 






4 














9 


2 


9 






13 














3 


18 






22 
















4 


27 






31 















M is the size of the underlying complex constellation. 

Key: (1) FD: Fast-decodable, (2) FGD: Fast-group-decodable, 

(3) EAST: Embedded Alamouti Space-Time. (4) TAST: Threaded Algebraic Space-Time. 



Table We now show that for any > 6 and R > 1 the 
new codes have lower ML decoding complexity than TAST 
codes. 

Proposition 15: The new codes of Section [V-B I for iV > 6 
and rates i? > 1, have lower ML decoding complexity than 
the corresponding TAST codes. 

Proof: For the TAST code of size N x N, the base design 
is the iV X iV DAST code with parameters Kb = 2N and 
g = 2. Also, it is clear that the new code of Section [V-BI will 
have ML decoding complexity less than or equal to that of the 
code generated by using the N antenna design from J-2,ag 
as the base. It is straightforward to show that for > 5, 
the number of real symbols in the design from J-'2,ag, say 
K'ij, satisfies > 2N. From ( fTTT ). the new code gives lower 
decoding complexity if 

This condition is equivalent to Ki, < K^. Since this is indeed 
the case, we have proved the required result. ■ 
3} Comparison with other codes: In |27|, FD codes, called 
Embedded Alamouti Space-Time (EAST) codes, were con- 
structed for rates i? = 1, 2, . . . , N/2 cspcu for N = 2, 4, 6 
and 8 antennas. Table |lll] shows that the new codes of 
Section lV^ have lower ML decoding complexity in each case. 
The EAST codes, however, have non-vanishing determinant 
property. In |5|, an FGD code for 4 antennas and rate 17/8 
was constructed and this code was punctured to obtain a rate 
2 code for 4 antennas. From Table |III1 we see that the new 



codes of Section I V-B I have lower ML decoding complexity 
than the codes in fSj. In |[28l, a rate 3/2 cspcu, 4 antenna 
code was constructed with complexity (see Examples |5] 
and [T4l) . The new code of section I V-B I has complexity only 
A'P-^. The rate 2 code for 4 antennas constructed in 1291 has 
an ML decoding complexity of order and non-vanishing 
determinant property. The new code that we report for = 4 
and R = 2 has a considerably lower ML decoding complexity 
of the order of A/^-S. The new codes match the code from f22| 
in decoding complexity for N = 4, R = 2 and have lower 
decoding complexity for N — 2, R = 2. The new N — 2, 
R = 2 code has same decoding complexity as the Silver 
code ||251 . 12611 and the new code has lower ML decoding 
complexity than the Golden Code l24l . 

From the above discussion it is clear that for N < 5 and 
R > 1 the complexity of ML decoding the new codes of 
Section IV-BI match the least known ML decoding complexity 
in the literature. When N > 6, N not a power of 2, and 
R > 1, the new codes have lower ML decoding complexity 
than the codes available in the literature. When iV > 8, a 
power of 2, and R > 3/2, again the new codes have lower 
ML decoding complexity than already known codes. When 
A^ > 8, a power of 2 and 1 < i? < 3/2, the new codes either 
have lower ML decoding complexity or match the least known 
ML decoding complexity in the literature. Thus, the subset of 
the new codes from Section IV-BI corresponding to R = N 
are asymptotically-optimal, fast-decodable and have least ML 
decoding complexity among all known full-rate, full-diversity 



codes in the literature. 

VI. Discussion 

In this paper, we have constructed new classes of 
asymptotically-good, full-diversity g-group ML decodable 
codes, g > 1 and new FGD codes. Using these codes we have 
constructed STBCs of least known ML decoding complexity 
for iV > 1 and R > 1 which include a class of asymptotically- 
optimal full-diversity, fast-decodable codes. The results of this 
paper lead us to the following questions. 

• Determining the highest possible rate of full-diversity, 
multigroup ML-decodable codes. 

• Is there an algebraic theory behind the constructions of 
multigroup ML decodable codes given in this paper? Are 
there other high-rate multigroup ML-decodable codes? 
Can improvements be made on rates reported in this 
paper? 

• Is it possible to derive a lower bound on the ML decoding 
complexity of the class of full-diversity STBCs of a given 
size N and rate i?? 

• How can one explicitly construct signal sets, that provably 
lead to full-diversity and large coding gain, given a design 
with full-rank weight matrices? 

• Proving the non-vanishing determinant property, either 
in the affirmative or otherwise, of the new class of 
asymptoically-optimal, fast-decodable codes proposed in 
this paper remains an interesting direction to pursue. 
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